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Abstract. We present an on-line algorithm for maintaining a topological order of a 
directed acyclic graph as arcs are added, and detecting a cycle when one is created. 
Our algorithm takes 0{m 1 ^ 2 ) amortized time per arc, where m is the total number of 
arcs. For sparse graphs, this bound improves the best previous bound by a logarithmic 
factor and is tight to within a constant factor for a natural class of algorithms that 
includes all the existing ones. Our main insight is that the bidirectional search method 
of previous algorithms does not require an ordered search, but can be more general. 
This allows us to avoid the use of heaps (priority queues) entirely. Instead, the deter- 
ministic version of our algorithm uses (approximate) median-finding. The randomized 
version of our algorithm avoids this complication, making it very simple. We extend our 
topological ordering algorithm to give the first detailed algorithm for maintaining the 
strong components of a directed graph, and a topological order of these components, 
as arcs are added. This extension also has an amortized time bound of 0{rn}^ 2 ) per 
arc. 

1 Introduction 

We consider three related problems on dynamic directed graphs: cycle detection; maintaining 
a topological order; and maintaining strong components, along with a topological order of 
them. Cycle detection and maintaining a topological order are closely connected, since a 
directed graph has a topological order if and only if it is acyclic. Previous work has focused 
mainly on the topological ordering problem; the problem of maintaining strong components 
has received little attention. We present a solution to both problems that improves the best 
known time bound for sparse graphs by a logarithmic factor. 

A topological order O of a directed graph is a total order of the vertices such that for 
every arc (v, w), O(v) < O(w). A directed graph has a topological order (and in general more 
than one) if and only if it is acyclic [12]. A directed graph is strongly connected if, for each 
pair of vertices v and w, there is a path from v to w. The strongly connected components of 
a directed graph are its maximal strongly connected induced subgraphs. These components 
partition the vertices. Given a directed graph, its graph of strong components is the multigraph 
whose vertices are the components of the given graph, with an arc (c(v),c(w)) for each arc 
(v, w) of the original graph, where c(x) is the component containing vertex x. Ignoring loops, 
the graph of strong components is acyclic; thus the components can be topologically ordered. 

Given a fixed n-vertex, m-arc graph, a topological order can be found in 0{n + to) time 
by either of two algorithms: repeated deletion of sources [17, 18] or depth-first search [33]. 
The former method extends to the enumeration of all possible topological ordcrings. Strong 
components, and a topological order of them, can also be found in 0(n+m) time by depth-first 
search, cither one-way [33, 7, 10] or two-way [1, 32]. 

In some applications, the graph is not fixed but changes over time. The incremental topo- 
logical ordering problem is that of maintaining a topological order of a directed graph as arcs 
are added, stopping when addition of an arc creates a cycle. This problem arises in incremental 
circuit evaluation [3], pointer analysis [26], management of compilation dependencies [20,22], 
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and deadlock detection [4]. In some applications cycles are not fatal; strong components, and 
a topological order of them, must be maintained. For example, pointer analysis can optimize 
work based on cyclic relationships [24]. 

In considering the incremental topological ordering and strong components problems, we 
shall assume that the vertex set is fixed and specified initially, and that the arc set is initially 
empty. One can easily extend our ideas to support vertex as well as arc additions, with a 
time of 0(1) per vertex addition. The topological ordering methods we consider also easily 
handle arc deletions, since deletion of an arc cannot create a cycle nor invalidate the current 
topological order, but our time bounds are only valid if there are no arc deletions. Maintaining 
strong components under arc deletions is a harder problem; we briefly discuss previous work 
on it in Section 6. 

One could of course recompute a topological order, or strong components, after each arc 
addition, but this would require 0(n+m) time per arc. Our goal is to do better. The incremen- 
tal topological ordering problem has received much attention, especially recently. Marchetti- 
Spaccamela et al. [21] gave an algorithm that takes 0(n) amortized time per arc addition. 
Alpcrn et al. [3] gave an algorithm that handles batched arc additions and has a good time 
bound in an incremental model of computation. Katriel and Bodlaender [15] showed that a 
variant of the algorithm of Alpern et al. takes ©(minjm 1 / 2 log n, to 1 / 2 + (n 2 log n)/m}) amor- 
tized time per arc addition. Liu and Chao [19] tightened this analysis to ©(m 1 / 2 + n 1 / 2 log n) 
per arc addition. Pearce and Kelly [25] gave an algorithm with an inferior asymptotic time 
bound that they claimed was fast in practice on sparse graphs. Ajwani et al. [2] gave an 
algorithm with an amortized time per arc addition of 0(n 2 ' 75 /m), which for dense graphs 
is better than the bound of Liu and Chao. Finally, Kavitha and Mathew [16] improved the 
results of both Liu and Chao and Ajwani et al. by presenting another variant of the algorithm 
of Alpern et al. with an amortized time bound per arc addition of 0(m 1//2 + (nlognj/m 1 ^ 2 ), 
and a different algorithm with an amortized time bound per arc addition of 0(n 2 - 5 /m). 

The problem of maintaining strong components incrementally has received much less at- 
tention. Fahndrich et al. [9] gave an algorithm that searches for the possible new strong 
component after each arc addition; their algorithm does not maintain a topological order of 
components. Pearce [23] and Pearce et al. [24] mention extending the topological ordering al- 
gorithm of Pearce and Kelly and that of Marchetti-Spaccamcla et al. to the strong components 
problem, but they provide very few details. 

We generalize the bidirectional search method introduced by Alpcrn et al. for incremental 
topological ordering and used in later algorithms. We identify the properties of this method 
that allow older algorithms to achieve their respective running times. We observe that the 
method does not require an ordered search (used in all previous algorithms) to be correct. 
This allows us to refine the general method into one that avoids the use of heaps (priority 
queues), but instead uses either approximate median-finding or random selection, resulting in 
an 0(m}t 2 ) amortized time bound per arc addition. The randomized version of our algorithm 
is especially simple. We extend our incremental topological ordering algorithm to provide the 
first detailed algorithm to maintain strong components and a topological order of them. This 
algorithm also takes 0{m 1 ^ 2 ) amortized time per arc addition. 

The body of our paper is organized as follows. Section 2 describes the bidirectional search 
method, verifies its correctness, and analyzes its running time. Section 3 refines the method 
to yield an algorithm fast for sparse graphs. Section 4 provides some implementation details. 
Section 5 extends the algorithm to the problem of maintaining strong components. Section 6 
examines lower bounds and other issues. In particular, we show in this section that on sparse 
graphs our (^(m 1 / 2 ) amortized time bound is tight to within a constant factor for a natural 
class of algorithms that includes all the existing ones. 

2 Topological Ordering via Bidirectional Search 

We develop our topological ordering algorithm through refinement of a general method that 
encompasses many of the older algorithms. By vertex order we mean the current topological 
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order. We maintain the vertex order using a data structure such that testing the predicate 
u x < y" for two vertices x and y takes 0(1) time, as does deleting a vertex from the order and 
reinserting it just before or just after another vertex. The dynamic ordered list structures of 
Dietz and Sleator [8] and Bender et al. [5] meet these requirements: their structures take O(l) 
time worst-case for an order query or a deletion, and O(l) time for an insertion, amortized 
or worst-case depending on the structure. For us an amortized bound suffices. In addition 
to a topological order, we maintain the outgoing and incoming arcs of each vertex. This 
allows bidirectional search. Initially the vertex order is arbitrary and all sets of outgoing and 
incoming arcs are empty. 

To add an arc (v,w) to the graph, proceed as follows. Add (v,w) to the set of arcs out 
of v and to the set of arcs into w. If v > w, do a bidirectional search forward from w and 
backward from v until finding either a cycle or a set of vertices whose reordering will restore 
topological order. 

A vertex is forward if it is w or has been reached from w by a path of arcs traversed forward, 
backward if it is v or has been reached from v by a path of arcs traversed backward. A vertex 
is scanned if it is forward and all its outgoing arcs have been traversed, or it is backward 
and all its incoming arcs have been traversed. To do the search, traverse arcs forward from 
forward vertices and backward from backward vertices until either a forward traversal reaches 
a backward vertex x or a backward traversal reaches a forward vertex x, in which case there is 
a cycle, or until there is a vertex s such that all forward vertices less than s and all backward 
vertices greater than s are scanned. 

In the former case, stop and report a cycle consisting of a path from w to x traversed 
forward, followed by a path from x to v traversed backward, followed by (v,w). In the latter 
case, restore topological order as follows. Let X be the set of forward vertices less than s 
and Y the set of backward vertices greater than s. Find topological orders of Ox and Oy 
of the subgraphs induced by X and Y, respectively. Assume s is not forward; the case of s 
not backward is symmetric. Delete the vertices inlUF from the current vertex order and 
reinsert them just after s, in order Oy followed by Ox- (See Figure 1.) 

Theorem 1. The incremental topological ordering method is correct. 

Proof. We prove by induction on the number of arc additions that the method maintains a 
topological order until it stops and reports a cycle. Consider the bidirectional search triggered 
by the addition of an arc (v, to) with v > w. The search must stop because it eventually runs 
out of arcs to traverse; when this happens, all forward and backward vertices are scanned, 
which means that the second stopping condition holds for any s (although it may hold earlier 
for some particular s). If a forward arc traversal during the search reaches a backward vertex 
x or a backward arc traversal reaches a forward vertex x, then there is a cycle, consisting of 
a path from w to x traversed forward, followed by a path from x to v traversed backward, 
followed by (v,w). 

Conversely, suppose that the addition of (y, w) creates a cycle, consisting of (w, w) followed 
by a path P from w to v. Consider the situation when the search stops. Let x be the first 
non-forward vertex on P and y the last non-backward vertex on P; both x and y exist since v 
is backward (non- forward) and w is forward (non-backward). Let u be the vertex preceding x 
on P and z the vertex following y on P. Then u is forward and z is backward. Traversal of the 
arc (u, x) causes either a cycle to be reported (if x is backward) or x to become forward (if it 
is previously unreached). The latter contradicts the choice of x. Thus cither the search stops 
and reports a cycle, or u is unscanned. Symmetrically, either the search stops and reports a 
cycle or z is unscanned. Since u < z, for any s either u < s or z > s. Thus the search cannot 
stop without reporting a cycle. We conclude that the method reports a cycle if and only if 
the addition of (v, w) creates one. 

Suppose the addition of (v, w) does not create a cycle. Then the search cannot report a cy- 
cle. Thus, for some s, the search will stop with all forward vertices less than s and all backward 
vertices greater than s scanned. We need to show that the new vertex order is topological, 
assuming that the old one was. Assume s is not forward; the case of s not backward is sym- 
metric. In the new order v < w, since v <E X and w e Y(J {s}. Let (x, y) be any arc other than 
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Fig. 1. Restoring topological order after bidirectional search. Since s is not forward, the vertices in 
Y are inserted (in topological order) just after s, followed by the vertices in X. 

(u, w). We do a case analysis based on which of the sets X and Y contain x and y. There are 
nine cases, but two are symmetric and two are impossible, reducing the number of cases to six. 

Case 1. {x, y} C X or {x, y} C Y: x < y in the new order since x and y are both in Ox or 
both in Oy . 

Case 2. x E X and y ^ X: y ^ Y since the graph is acyclic. Since x is scanned, y > s in the 
old order, which implies x < y in the new one. 

Case 3. x € Y and y ^ Y: y ^ X since (x, y) ^ (i>, w). Since x € Y, y > s in the old order, 
which implies x < y in the new one. 

Case 4. x ^ X U Y and y € X: x < y and y < s in the old order, which implies x < y in the 
new one. 

Case 5. x ^ X U Y and y € Y: x < s in the old order since y is scanned and x <^ Y. In the 
new order, x < s < y. 

Case 6. x ^ X U Y and y^XUY: x<j/in both the old and new orders. □ 

Lemma 1. The time per arc addition is 0(1) plus O(l) per arc traversed by the search plus 
any overhead needed to guide the search. 

Proof. The time for the bidirectional search is 0(1) plus 0(1) per arc traversed plus overhead. 
The subgraphs induced by X and Y contain only traversed arcs. The time to topologically 
order them is linear since a static topological ordering algorithm suffices. The time to delete 
vertices in X U Y from the old vertex order and reinsert them is O(l) per vertex in X U Y. □ 

To obtain a good time bound we need to minimize both the number of arcs traversed and 
the search overhead. In our discussion we shall assume that no cycle is created. Only one arc 
addition, the last one, can create a cycle; the last search takes O(m) time plus overhead. 

We need a way to charge the search time against graph changes caused by arc additions. 
To measure such changes, we count pairs of related graph elements, either vertex pairs, vertex- 
arc pairs, or arc-arc pairs: two such elements are related if they are on a common path. The 
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number of related pairs is initially zero and never decreases. There are at most (™) < n 2 /2 
vertex-vertex pairs, nm vertex-arc pairs, and (™) < m 2 /2 arc-arc pairs. Of most use to us 
are the related arc-arc pairs. 

We limit the search in three ways to make it more efficient. First, we restrict it to the 
affected region, the set of vertices between w and v. Specifically, only arcs (u,x) with u < v 
are traversed forward, only arcs (y, z) with z > w are traversed backward. This suffices to 
attain an 0{n) amortized time bound per arc addition. The bound comes from a count of 
newly-related vertex-arc pairs: each arc (u, x) traversed forward is newly related to v, each 
arc (y, z) traversed backward is newly related to w. The algorithm of Marchetti-Spaccamela 
et al. [21] is the special case that does just a unidirectional search forward from w using 
s = v, with one refinement and one difference: it does a depth-first search, and it maintains 
the topological order as an explicit mapping between the vertices and the integers from 1 to 
n. 

Unidirectional search allows a more space-efficient graph representation, since we need 
only forward incidence sets, not backward ones. But bidirectional search has a better time 
bound if it is suitably limited. We make the search balanced: each traversal step is of two arcs 
concurrently, one forward and one backward. There are other balancing strategies [3,15,16], 
but this simple one is best for us. Balancing by itself does not improve the time bound; we 
need a third restriction. We call an arc (u, x) traversed forward and an arc (y, z) traversed 
backward compatible if u < z. Compatibility implies that (u, x) and (y, z) are newly related. 
We make the search compatible: each traversal step is of two compatible arcs. 

Lemma 2. If the searches are compatible, the amortized number of arcs traversed during 
searches is 0{m 1 ^ 2 ) per arc addition. 

Proof. We count related arc-arc pairs. Consider a compatible search of 2k arc traversals, k 
forward and k backward. Order the arcs {u, x) traversed forward in increasing order on u, 
breaking ties arbitrarily. Let (u,x) be the \k/2\ th arc in the order. Arc (u,x) and each arc 
following (u, x) has a compatible arc (y, z) traversed backward. Compatibility and the ordering 
of forward traversed arcs imply that u < z. Thus each such arc {y, z) is newly related to (u, x) 
and to each arc preceding (u,x), for a total of at least (k/2) 2 newly related pairs. 

We divide searches into two kinds: those that traverse at most m 1 / 2 arcs and those that 
traverse more. Searches of the first kind satisfy the bound of the lemma. Let 2ki be the 
number of arcs traversed during the i th search of the second kind. Since 2ki > m 1 / 2 and 
Ei(fci/2) 2 < m 2 /2, £ 2 fc 4 < 2J2 i k t 2 /m 1 / 2 = 8£,(^/2)7™ 1/2 < 4m 3 / 2 . Thus there are 
0{m 1 / 2 ) arc traversals per arc addition. □ 

We still need a way to do a compatible search. The most straightforward way is to make 
the search ordered: traverse arcs (u, x) forward in non-decreasing order on u and arcs (y, z) 
backward in non-increasing order on z. We can implement an ordered search using two heaps 
(priority queues) to store unscanned forward and unscanned backward vertices. In essence this 
is the algorithm of Alpern et al. [3], although they use a different balancing strategy. The heap 
overhead is O(logn) per arc traversal, resulting in an amortized time bound of 0(m x l 2 logn) 
per arc addition. More-complicated balancing strategies lead to the improvements [15, 19, 16] 
in this bound for non-dense graphs mentioned in Section 1. 

3 Compatible Search via a Soft Threshold 

The running time of an ordered search can be reduced further, even for sparse graphs, by using 
a faster heap implementation, such as those of van Emde Boas [39, 38], Thorup [37], and Han 
and Thorup [11]. But we can do even better, avoiding the use of heaps entirely, by exploiting 
the flexibility of compatible search. What we need is a way to find a pair of candidate vertices 
for a traversal step: an unscanned forward vertex u and an unscanned backward vertex z 
such that u < z. It is easy to keep track of unscanned forward and backward vertices, but if 
we have an unscanned forward vertex u and an unscanned backward vertex z, what do we 
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do if u > z? One answer is to (temporarily) bypass one of them, but which one? To resolve 
this dilemma, we use a soft threshold s that is an estimate of the stopping threshold for the 
search. If u > z, at least one is on the wrong side of s: either u > s or z < s (or both). We 
call such a vertex far. Our decision rule is to bypass a far vertex. 

In addition to s, we need two hard thresholds to bound candidate vertices, a low threshold 
I and a high threshold h. We also need a partition of the candidate forward vertices into two 
sets, A and B, and a partition of the candidate backward vertices into two sets, C and D. 
The vertices in B U D are far; we call the vertices in A U C near (they may or may not be 
far). To do a compatible search, initialize I tow, htov, s to v or w or any vertex in between, 
A to {w}, C to {v}, and B, D, X, and Y to empty. Then repeat an applicable one of the 
following cases until A U B or C U D is empty (see Figure 2): 

Case If. A is empty: move all vertices in B to A, set I = s, let s be a vertex in A, and set 
s = min{s, h}. 

Case lb. C is empty: move all vertices in D to C, set h — s, let s be a vertex in C, and set 
s = max{s, I}. 

In the remaining cases A and C are non-empty. Choose a vertex u in A and a vertex z in C. 

Case 2f. u > h: delete u from A. 

Case 2b. z < I: delete z from C. 

Case 3f. max{z, s} < u < h: move u from A to B. 

Case 3b. I < z < min{u, s}: move z from C to D. 

In the remaining cases I < u < z < h. 

Case 4f. All arcs out of u are traversed: move u from A to X. 
Case 4b. All arcs into z are traversed: move z from C to Y . 

Case 5. No other case applies: traverse an untraversed arc (u, x) out of u and an untraversed 
arc (y, z) into z. If x is backward or y is forward, stop and report a cycle. If x is unreached, 
make it forward and add it to A; if y is unreached, make it backward and add it to C. 

If the search stops without detecting a cycle, set s = h if A U B is empty, s = I otherwise. 
Delete from X all forward vertices no less than s and from Y all backward vertices no greater 
than s. Reorder the vertices as described in Section 2. 

This algorithm does some deletions and bypasses lazily that could be done more eagerly. 
In particular, when a vertex is moved from the far set to the near set in Case If or lb, it can 
be deleted if it is on the wrong side of the corresponding hard threshold (by applying Case 2f 
or 2b immediately) , and it can be left in the far set if it is on the wrong side of the new soft 
threshold. The algorithm does traversal steps eagerly: it can do such a step even if one of the 
two vertices involved is far, since all that Case 5 requires is compatibility. The results below 
apply to the original algorithm and to variants that do deletions and bypasses more eagerly 
and traversal steps more lazily. 

Theorem 2. Compatible search with a soft threshold is correct. 

Proof. Compatible search is just a special case of the general method presented in Section 2. 
If the search stops with A U B empty, then all forward vertices less than h are scanned; hence 
the stopping condition holds for s = h. The case of C U D empty is symmetric. □ 

Lemma 3. The running time of compatible search with a soft threshold is 0(1) plus O(l) 
per arc traversed plus 0(1) for each time a vertex becomes near. 

Proof. Each case either traverses two arcs and adds at most two vertices to AUG, or perma- 
nently deletes a vertex from A U C, or moves a vertex from A U C to B U D, or moves one or 
more vertices from B U D to A U C. The number of times vertices are moved from A U C to 
B U D is at most the number of times vertices become near. □ 
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/ s h 

A = {),B={b,d),D= {}, C= [e,c] 




/ s h 

A = {},B={d},D={c},C={) 




1 s h 

A={},B={},D={c),C={) 

Fig. 2. Compatible search via a soft threshold. At the end of the search, AL)B is empty, so s = h — d 
is the final threshold and the vertices are reordered as described in Section 2. 
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The algorithm is correct for any choice of soft threshold, but only a careful choice makes 
it efficient. Repeated cycling of vertices between near and far is the remaining inefficiency 
We choose the soft threshold to limit such cycling no matter how the search proceeds. A good 
deterministic choice is to let the soft threshold be the median or an approximate median of 
the appropriate set (A or C); an e- approximate median of a totally ordered set of g elements is 
any element that is no less than eg and no greater than eg of the elements, for some constant 
e > 0. The median is a 1/2-approximate median. Finding the median or an approximate 
median takes O(g) time [6,31]. An alternative is to choose the soft threshold uniformly at 
random from the appropriate set. This gives a very simple yet efficient randomized algorithm. 

Lemma 4. If each soft threshold is an e-approximate median of the set from which it is 
chosen, then the number of times a vertex becomes near is 0(1) plus 0(1) per arc traversed. 
If each soft threshold is chosen uniformly at random, then the expected number of times a 
vertex becomes near is O(l) plus 0(1) per arc traversed. 

Proof. The value of I never decreases as the algorithm proceeds; the value of h never increases. 
Let k be the number of arcs traversed. Suppose each soft threshold is an e-approximate median. 
The first time a vertex is reached, it becomes near. Each subsequent time it becomes near, it 
is one of a set of g vertices that become near, as a result of being moved from B to A or from 
D to C. The two cases arc symmetric; consider the former. We shall show that no matter 
what happens subsequently, at least eg vertices have become near for the last time. Just after 
s is changed, at least eg vertices in A are no less than s, and at least eg vertices in A are no 
greater than s. Just before the next time s changes, I > s or h < s. In the former case, all 
vertices no greater than s can never again become near; in the latter case, all vertices no less 
than s can never again become near. We charge the group of g newly near vertices to the 
vertices that become near for the last time. The total number of times vertices can become 
near is at most (1 + l/e)(2 + fc), where k is the number of arcs traversed: there are at most 
2 + k forward and backward vertices and at most 1 + 1/e times a vertex can become near per 
forward or backward vertex. 

Essentially the same argument applies if the soft threshold is chosen uniformly at random. 
If a set of g vertices becomes near, the expected number that become near for the last time is 
at least Ei<K 9 /2( 2i )/s = (5/2+ 1 )/ 2 > ff/ 4 if 9 is even, at least \g/2\ +Ei< 4 <L 3 /2j ( 2 *)/.9 = 
\g/2Y/g>g/4 if g is odd. The total expected number of times vertices can become near is 
at most (5/4)(2 + k). □ 

Theorem 3. The amortized time for incremental topological ordering via compatible search 
is 0(m 1 l 2 ') per arc addition, worst-case if each soft threshold is an e-approximate median of 
the set from which it is chosen, expected if each soft threshold is chosen uniformly at random. 

Proof. Immediate from Lemmas 1-4. □ 
4 Implementation 

In this section we fill in some implementation details. We also give alternative implementations 
of dynamic ordered lists and the reordering step. 

We need a way to keep track of traversed and untraversed arcs. We maintain each incidence 
set as a singly- linked list, with a pointer indicating the first untraversed arc. Each time a vertex 
is reached during a search, the pointer for its appropriate incidence list is reset to the front. 
Since a vertex cannot be both forward and backward during a single search, one pointer per 
vertex suffices. 

We also need a way to report cycles that are detected, if this is required of the application. 
For this purpose we maintain, for each forward and backward vertex other than v and w, the 
arc by which it is reached. 

The description of compatible search is quite general; in particular, it does not specify 
how to maintain the sets of candidate vertices, or the order in which to consider candidates. 
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A simple implementation is to store the candidate forward vertices in a deque [17] (double- 
ended queue) F, with vertices in A preceding those in B and a pointer to the first vertex in 
B. The first vertex on F is u. New forward vertices are added to the front of F; bypassed far 
vertices are moved from the front to the back of F. When the pointer to the first vertex in B 
indicates the first vertex on F, this pointer is moved to the back, and I and s are updated. 
A deque R that operates in the same way stores the backward vertices. Both F and R are 
actually steques [13] (stack-ended queues or output-restricted deques [17]), since deletions are 
only from the front. Thus each can be implemented as an array or as a singly- linked, possibly 
circular list. 

The dynamic ordered list implementations of Dictz and Sleator [8] and of Bender et al. [5] 
are two- level structures that store the elements, in our case vertices, in contiguous blocks of up 
to b elements, with b — O(logn), and store the blocks in a doubly-linked list. The blocks have 
numbers that are consistent with the list order, higher numbers for later blocks. The elements 
within a block have numbers that are consistent with the order within the block. Thus each 
element has a two-part number that can be retrieved in O(l) time and used to test order in 
0(1) time, worst-case. The two methods differ in the details of renumbering when insertions 
take place. Deletions require no explicit renumbering and take 0(1) time worst-case. Insertion 
takes O(l) amortized time. This bound becomes worst-case if incremental updating is done. 

For us an amortized bound suffices. Even though the renumbering schemes of Dietz and 
Sleator and of Bender et al. are not too complicated, we can use a simpler two-level structure 
if we are willing to suffer an 0{v}l 2 ) additive overhead per arc addition, which does not 
affect the asymptotic bound. We use a block size of Ol^n 1 ^ 2 ), we completely renumber the 
elements within a block when the block contents change other than by a deletion, and we 
completely renumber the blocks when a block is inserted or deleted. We number the blocks, 
and the elements within a new block, with consecutive integers starting from 1. When a block 
becomes less than half full as a result of deletions, we combine it with one of its neighbors. 
With this method the time for a set of k consecutive insertions is 0(n x l 2 + k), and the 
amortized time per deletion is 0(1). This data structure is essentially a two-level B-tree. 
Using more levels with a smaller bound on block size decreases the additive overhead per arc 
addition but increases the constant factor for order queries. 

We can topologically order the subgraphs induced by X and Y by using either of the two 
linear-time algorithms for static topological ordering mentioned in Section 1. The subgraph 
induced by X contains exactly the vertices less than s that are reachable from w. Thus a 
simple method to order X is to do a depth-first search forward from w, traversing arcs only 
from vertices less than s, and order the reached vertices less than s in decreasing postorder 
[33]. A symmetric depth-first search backward from v orders Y. An alternative to using a 
static topological ordering method is to sort X and Y on the current vertex order. If we use 
binary comparisons, the time to sort is O(fclogfc), where k is the number of vertices sorted. 
This method has the disadvantage that it does not preserve the 0{m}/ 2 ) amortized bound 
per arc addition but incurs a logarithmic overhead. Alternatively we can take advantage of 
the vertex numbering used by the dynamic ordered list structure. Since the number of bits 
needed to represent the vertex numbers is O(logn), we can sort in 0(n 1/<2 + k) time via a 
radix sort with a fixed number of passes; the additive term can be decreased at the expense of 
increasing the constant factor. Radix sorting preserves the 0{m}/ 2 ) bound per arc addition. 

Which implementation of the algorithm is fastest in practice is a question to be resolved 
by experiments. 

5 Maintenance of Strong Components 

A natural extension of our topological ordering algorithm is to the problem of maintaining 
strong components, and a topological order of them, as arcs are added. This problem has 
received much less attention than the topological ordering problem. Pearce [23] and Pearce 
et al. [24] mention extending their topological ordering algorithm and that of Marchetti- 
Spaccamela et al. [21] to the strong components problem, but they provide very few details. 
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We shall describe how to extend the general method of Section 2 and the specific method of 
Section 3 to the strong components problem, while maintaining for the latter the 0(m}l 2 ) 
amortized time bound per arc addition. We describe only the additions and changes needed. 
We begin at a high level and then fill in the implementation details. 

The graph of strong components contains a vertex for each strong component, and, for 
each original arc (v, w), an arc (c(w), c(w)), where c(x) is the vertex for the strong component 
containing original vertex x. The graph of strong components can contain loops (arcs of the 
form (x, x)) and multiple arcs; we allow both, but ignore loops when searching. To maintain a 
topological order of the graph of strong components (ignoring loops) , we extend the method 
of Section 2 as follows: 

To add an arc (v, w), begin by finding c(v) and c(w). Add (c(v), c(w)) to the set of arcs out 
of c(v) and to the set of arcs into c(w). If c(v) > c(w), do a bidirectional search forward from 
c(w) and backward from c{v), but ignore loops and do not stop when a cycle is detected but 
allow vertices to be both forward and backward. Specifically, do the search as follows. When 
about to traverse a loop, put it aside instead of traversing it, so it will never be examined in 
future searches. When reaching a backward vertex x by a forward traversal, do not stop, but 
instead make x forward. When reaching a forward vertex x by a backward traversal, do not 
stop, but instead make x backward. Call a vertex forward scanned if all its outgoing arcs have 
been traversed, backward scanned if all its incoming arcs have been traversed. Continue the 
search until there is a vertex s such that all forward vertices less than s are forward scanned 
and all backward vertices greater than s are backward scanned. 

The addition of (v,w) creates a new cycle, and a single new strong component, if and 
only if at the end of the search some vertex is both forward and backward. If there is no such 
vertex, reorder the vertices as in Section 2. Otherwise, let X be the set of forward vertices 
less than s and Y the set of backward vertices greater than s. The vertex set of the new 
component is a subset of X U Y U {s}. Find the vertex set Z of the new component and 
contract the vertices in Z to form a single new vertex x representing the component. Find 
topological orders Ox and Oy of the vertices in X — Z and the vertices in Y — Z. Delete all 
vertices in X and Y from the vertex order. If s is in the component, replace s by x in the 
order. (Insert x after s and delete s.) Otherwise, insert x just after s if s is not forward, just 
before s otherwise. Reinsert the vertices in X — Z just after x in order Ox and insert the 
vertices in Y — Z just before x in order Oy. 

A simple extension of the proof of Theorem 1 shows that this algorithm is correct. An 
arc can only become a loop once and be put aside once. If a new component is created, the 
time to find it is linear in the number of arcs traversed by the search if one of the linear-time 
static algorithms for finding strong components mentioned in Section 1 is used. (We describe 
a simpler alternative below.) 

We can use compatible search with a soft threshold in this algorithm, the only change 
being to allow vertices to be both forward and backward. The proof of Theorem 2 gives 
correctness. 

Before analyzing the running time of the algorithm, we fill in some implementation details. 
We represent the vertex sets of the strong components using a disjoint set data structure [34, 
36]. Each set has a distinguished vertex, defined by the data structure, that represents the 
set. Two operations are possible: 

— find(x): return the representative of the set containing x; 

— link(x, y): given representatives x and y of two different sets, form their union, and choose 
one of x and y to represent it. 

Originally each vertex is in a singleton set. 

We maintain the arcs in their original form, with their original end vertices. For each 
representative of a component, we maintain the set of arcs out of component vertices and 
the set of arcs into component vertices. These sets are singly linked circular lists, so that 
catenation takes 0(1) time. Each representative also has two pointers, to the first untraversed 
forward arc and the first untraversed backward arc. To process an arc (u, to) during a search, 
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put (v,w) aside (delete it from the incidence list) if find(v) = find(w); otherwise, traverse 
(find(v), find(w)). To form a new strong component, combine representatives into a single 
component by using link to combine the corresponding vertex sets and catenating all the 
forward incidence lists and all the backward incidence lists of the representatives to form the 
incidence lists of the representative of the new component. 

With this data structure the time per link is 0(1) [34,36]. Each catenation of arc lists 
takes O(l) time per link. The total time for all the links and all the catenations of arc lists 
over all arc additions is 0(min{n, m}), which is O(l) per arc addition. There are two finds 
per arc examined during the search. Such an arc is either found to be a loop and never 
examined again, or it is traversed by the search. Lemmas 1-4 and Theorem 3 hold for the 
extension to strong components, not counting the time for finds. The number of finds is 
©(m 1 / 2 ) per arc addition by Lemma 2. Since the bound on the ratio of finds to links is so 
high ((^(to 1 / 2 )), the amortized time for finds is (^(m 1 / 2 ) per arc addition, even if the disjoint 
set implementation uses path compression with naive linking [36]; this bound also holds for 
path compression with linking by rank or size [36]. The total amortized time per arc addition 
is 0{m}' 2 ). 

We conclude this section with three remarks about implementation. First, there is a simple 
way to topologically order X and Y and to find the new strong component when one is created. 
The method extends the topological ordering method mentioned in Section 4. Do a forward 
depth-first search from w, traversing arcs only from vertices less than s. When traversing 
an arc (x,y), if y is backward, make x backward. When the search stops, the vertices in 
X O Z are the vertices reached by the search that are backward; decreasing postorder is a 
topological order of X— Z. A symmetric backward search from v gives the vertices in YDZ and 
topologically orders Y — Z. Vertex s is in the component if it is both forward and backward. 

Second, our representation of strong components does not provide a way to list the vertices 
in each component. To allow this, maintain for each component a circularly linked list of the 
vertices in it: catenating such lists when a new component is formed takes O(l) time per 
link, and the vertices of a component can be listed in O(l) time per vertex by starting at the 
representative (or at any vertex) and traversing the list. 

Third, our method does not maintain a representation of the original graph. To represent 
both the graph of strong components and the original graph, maintain, for each vertex, four 
lists of its incident arcs: outgoing but not yet found to be loops; outgoing loops; incoming but 
not yet found to be loops; incoming loops. The first and third lists are also part of the lists 
of outgoing and incoming arcs of the component representative; when the end of a sublist is 
reached during a search, this can be detected in O(l) time by checking one end of the next 
arc. Each arc is on up to four lists, two of outgoing arcs and two of incoming arcs, but it only 
needs two pointers to arcs next on a list. (See Figure 3.) 

6 Lower Bounds and Other Issues 

We have described an incremental topological ordering algorithm that takes 0(m v l' 2 ) amor- 
tized time per arc addition. In combination with the 0(n 2 5 jm) bound of the second algorithm 
of Kavitha and Mathew, this gives an overall bound of C^minjm 1 / 2 , n 2 5 /m}) per arc addition. 
For m/n = o(logn), our bound is asymptotically smaller than that of Kavitha and Mathew 
(Oim 1 !" 1 + (nlogn)/™ 1 / 2 )), by a logarithmic factor for m/n = 0(1). Although our approach 
is based on that of Alpern et al. [3] and its variants, our method avoids the use of heaps. 
The randomized version of our algorithm is especially simple. Like previous algorithms, our 
algorithm can handle arc deletions, since a deletion preserves the validity of the current topo- 
logical order, but the time bound is no longer valid. Katriel and Bodlaender provide a class of 
examples on which our algorithm runs in fi(m 1 / 2 ) time per arc addition; thus our analysis is 
tight. We have extended the algorithm to the problem of maintaining strong components and 
a topological order of strong components. In this problem, arc deletions are harder to handle, 
since one arc deletion can cause a strong component to split into several smaller ones. Roditty 
and Zwick [29] presented a randomized algorithm for maintaining strong components under 
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Fig. 3. Data structure for maintaining the graph of strong components and the original graph. Only 
incoming arc lists are shown. Component x and y were recently linked; all non-loop arcs in the new 
component are accessible from x. 



arc deletions, but no additions, given an initial graph. The expected amortized time per arc 
deletion is 0(n) and the query time is 0(1), worst-case. Their algorithm does not maintain a 
topological order of the vertices but can be easily modified to do so, with the same bounds. 
If both additions and deletions are allowed, there is no known solution better than running 
an 0(m)-time static algorithm after each graph change, even for the simplest problem, that 
of cycle detection. There has been quite a bit of work on the harder problem of maintaining 
full reachability information for a dynamic graph. See [29, 30]. 

For the incremental topological ordering problem there are a couple of lower bounds, but 
there are large gaps between the existing lower bounds and the upper bounds. Ramalingam 
and Reps [28] gave a class of examples in which n — 1 arc additions force f2(n log n) vertices 
to be reordered, no matter what topological order is maintained. This is the only general 
lower bound. Katriel [14] considered what she called the topological sorting problem, in which 
the topological order must be maintained as an explicit map between the vertices and the 
integers between 1 and n. For algorithms that only renumber vertices within the affected 
region, she gave a class of examples in which 0(n) arc additions cause fi(n 2 ) vertices to be 
reordered. The algorithms of Marchetti-Spaccamela et al., Pearce and Kelly, Ajwani et al., 
and the 0(n 25 ) algorithm of Kavitha and Mathew are all subject to this bound, although 
our algorithm is not. 

For topological ordering algorithms that reorder only vertices within the affected region, 
we can obtain a lower bound of f2(wn 1//2 ) on the total number of vertex reorderings in the 
worst case, assuming m — J?(n). For simplicity assume that to is a perfect square and m 1 / 2 
evenly divides n. Number the vertices from 1 through n in their initial topological order; use 
these fixed numbers to permanently identify the vertices even as the order changes. Begin 
by adding arcs to form to 1 / 2 paths, each consisting of a sequence of k = njm 1 ! 2 consecutive 
vertices. Then add arcs (2k, 1), (3k, I),..., (n, 1), (3k,k + l), (4fc,fc+l), . . . , (n,k + l), (4fc, 2fc+ 
1), (5k, 2k + 1), . . . , (n, 2k + 1), and so on. The total number of arcs added is n — to 1 / 2 to form 
the initial paths plus to 1 / 2 (to 1 / 2 — l)/2, totaling less than to if n < to/2 + 3m 1 / 2 /2. After the 
initial paths are formed, each arc addition forces the reordering of k vertices if the reordering 
is only within the affected region, for a total of fi^m 1 / 2 ) vertex reorderings. The reordering 
is forced: each arc addition causes two paths to change places. (See Figure 4.) 

The bound of fi(nm}/ 2 ) applies to all existing algorithms, including ours; for sparse graphs, 
the running time of our algorithm matches this bound. If we are only interested in minimizing 
the number of vertex reorderings, not minimizing the running time, we can get a matching 
upper bound of 0(nm}l 2 ) on the number of vertex reorderings by doing an ordered bidirec- 
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Fig. 4. The fijnm 1 ' 2 ) vertex reordering construction with n = 12, m = 16. 



tional search that alternates between scanning a forward vertex and scanning a backward 
vertex; a count of related vertex pairs gives the bound. 

Another observation is that if the incident arc lists are sorted by end vertex, our compatible 
search method can be modified so that the total search time over all arc additions is 0(n 2 ): 
stop traversing arcs incident to a vertex when the next arc is incident to a vertex outside the 
hard thresholds. This bound, also, comes from a count of related vertex pairs. Unfortunately, 
keeping the arc lists sorted seems to require more than 0(m 3 / 2 ) time, giving us no actual 
improvement. The 0(n 2 75 )-time algorithm of Ajwani et al. uses this idea but keeps the arc 
lists partially sorted, trading off search time against arc list reordering time. 

The running time analysis of Ajwani et al. and that of Kavitha and Mathew for their 
0(n 2 5 )-time algorithm rely on a linear program to bound the total amount by which vertex 
numbers change. Although the solution to this linear program is 0(n 2 - 5 ), it may not capture 
all the constraints of the problem, and Kavitha and Mathew do not provide a class of examples 
for which their time bound is tight. One would like such a class of examples, or alternatively 
a tighter analysis of their algorithm. 

We have used amortized running time as our measure of efficiency. An alternative way to 
measure efficiency is to use an incremental competitive model [27], in which the time spent 
to handle an arc addition is compared against the minimum work that must be done by any 
algorithm, given the same current topological order and the same arc addition. The minimum 
work that must be done is the minimum number of vertices that must be reordered, which is 
the measure that Ramalingam and Reps used in their lower bound. But no existing algorithm 
handles an arc addition in time polynomial in the minimum number of vertices that must 
be reordered. To obtain positive results, some researchers have measured the performance of 
their algorithms against the minimum sum of degrees of vertices that must be reordered [3] or 
a more-refined measure that counts out-degrees of forward vertices and in-degrees of backward 
vertices [25]. For these models, appropriately balanced forms of ordered search are competitive 
to within a logarithmic factor [3, 25]. In such a model, our algorithm is competitive to within 
a constant factor. We think, though, that such a model is misleading: it docs not account 
for the possibility that different algorithms may maintain different topological orders, it does 
not account for correlated effects of multiple arc additions, and good bounds have only been 
obtained for a model that may overcharge the adversary. 

Alpern et al. and Pearce and Kelly consider batched arc additions as well as single arc 
additions. We have not yet considered generalizing compatible search to handle batched arc 
additions. Doing so might lead improvements in practice, if not in theory. 
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Our algorithm uses a vertex numbering scheme in which all vertices have distinct numbers. 
Alpern et al. allowed vertices to be numbered the same if there is no path between them, in an 
effort to minimize the number of distinct vertex numbers. Our algorithm can be modified to 
include this idea, as follows. Add an extra level to the dynamic list order structure: vertices are 
grouped into those of equal number; each group is an element of a block in the doubly-linked 
list of blocks. (See Section 4.) Start with all vertices in a single group. Having computed X 
and Y after a search, delete all vertices in X and Y from their respective groups, and delete 
all empty groups. If s is not forward, delete all forward vertices from the group containing s 
and add these vertices to X. If s is not backward, delete all backward vertices from the group 
containing s and add them to Y. (If s is neither forward nor backward, do both.) Assign 
each vertex in A to a new group corresponding to its maximum path length (in arcs) from 
w; assign each vertex in Y to a new group corresponding to its maximum path length (in 
arcs) to v. If s is not forward, insert the new groups just after that of s, with the groups of 
Y first, in decreasing order of their maximum path length to v, followed by the groups of X, 
in increasing order of their maximum path length from w. Proceed symmetrically if s is not 
backward. Computing maximum path lengths takes linear time [35], so the overall time bound 
is unaffected. This method extends to the maintenance of strong components. Whether this 
idea yields a speed-up in practice is an experimental question. 
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